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REMARKS 

Claims 2-4, 6-8, 10-12, 14-16, 18-20, 22-24, 26-28, 30-32, 34-36, 38-40, 42-44, 46-48, 
50-54, and 56-65 are withdrawn as being directed to non-elected subject matter. Claims 5, 9, 13, 
17, 21, 25, 29, 33, 37, and 41 are withdrawn as being directed to non-elected amino acid of SEQ 
ID Nos: 4, 6, 8, 10, 12, 14, 16, 18, 20, and 22, respectively. Claim 49 has been cancelled without 
prejudice or disclaimer. Claim 1 has been amended to recite an isolated protein comprising "the 
amino acid sequence of SEQ ID NO: 2 or a salt thereof." Support for the amendment can be 
found in the specification, for example, at least at p. 17, lines 27-29. Claim 45 has been 
amended to correct the case of the word "claim" and to recite a "composition for modulating the 
differentiation of and/or metabolism in an adipocyte comprising the protein or a salt thereof." 
Support for the amendment can be found, for example, at least at the paragraph bridging p. 70 
and p. 71 and at Example 4 at p. 149, line 6 to p. 150, line 7. Claim 55 has been amended to 
correct the case of the word "claim" and to delete the language "its partial peptide." Claims 66- 
67 have been added. Support for claim 66 can be found throughout the specification, for 
example, at least at the paragraph bridging p. 70 and p. 71 and at Example 4 at p. 149, line 6 to p. 
150, line 7. Support for claim 67 can be found throughout the specification, for example, at least 
at p. 59, line 4 to p. 63, line 1, p. 69, line 33 to p. 71, line 4, and at Example 4 at p. 149, line 6 to 
p. 150, line 7. Thus, the amended and new claims are fully supported by the specification and 
contain no new matter. 

Applicants respectfully request entry of the amendments. Once entered, claims under 
consideration are claims 1, 45, 55, and 66-67. 



-19- 



U.S. Application No. 10/520,783 
Attorney Docket No. 10525.0014-00000 



IDS 

Applicants note with appreciation that "the references cited in the IDS filed 8/9/05, the 
IDS filed 1/10/05 and the IDS filed 6/5/07 have been considered by the Examiner." Office 
Action at p. 2. Applicants note that references WO 01/57188, WO 01/00638, WO 01/64835, and 
WO 01/48192 in the IDS filed August 9, 2005, have not been initialed. Applicants respectfully 
request consideration of those references and that the Office initial those references after they 
have been considered. 

Objections to the Specification 

The Office has objected to the disclosure. 

First, the Office has required that the continuing data of this application should be 
updated. Accordingly, Applicants have amended the specification to include the priority data. 

Second, the Office has required that Applicants "delete the embedded hyperlink and/or 
other forms of browser executable code." Office Action at p. 3. According to the Office, 
"Applicants may remove "http://" so that the hyperlinker becomes inactive, i.e., the content 
followed by Http:// can be and should be left behind; and thus, a browser will only interpret the 
rest of the URL as text." Id. Accordingly, Applicants have amended the specification to remove 
the term "http://." 

Third, the Office has required that 

at page 34, line 9, "RACE" should be spelled out in full for the first 
instance of use; see also page 59, line 19-21, "PACAP", "GHRH", 
"CRF", "ATCH" "GRP", "PTH"; page 59, lines 25-26, "GRO", 
"NAP", "ENA", "PF4" "IP10", "GCP", "MCP", "HC14", "MIP" 
and "RANTES". 

Office Action at p. 3. 
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Each of these acronyms are disclosed in the specification and their full names are well- 
known in the art. Nonetheless, Applicants have amended the specification to spell out each 
name. For example, in the second full paragraph at p. 34, the specification discloses "RACE," 
which is known in the art as "rapid amplification of cDNA ends." See p. 8998, left column of 
Frohman et al. (Proceedings of the National Academy of Sciences USA 85:8998-9002, 1988) 
(copy enclosed). 

At the paragraph bridging pages 59 and 60, the specification discloses "PACAP," 
"GHRH," "CRF," "ACTH," "GRP," "PTH," "TRH," "GRO," "NAP," "EN A," "PF4," "IP 10," 
"GCP," "MCP," "MIP," "RANTES," and "IL." "PACAP" and "GHRH" are known in the art as 
"pituitary adenylate cyclase-activating polypeptide" and "growth hormone-releasing hormone," 
respectively. See Abstract of Montero et al. (Journal of Molecular Endocrinology 25:157-168, 
2000) (copy enclosed). "CRF" and "ACTH" are known in the art as "corticotropin-releasing 
factor" and "adrenocorticotropin hormone." See Abstract and p. 212, right column, second full 
paragraph in Sarnyai et al. (Pharmacological Reviews 53(2):209-243, 2001) (copy enclosed). 
"GRP" is known in the art as "gastrin-releasing peptide." See Abstract of Wada et al. (The 
Journal ofNeuroscience 10(9):29 17-2930, 1990) (copy enclosed). "PTH" is known in the art as 
"parathyroid hormone." See Abstract of Mannstadt et al. (American Journal of Physiology- 
Renal Physiology 277:665-675, 1999) (copy enclosed). "TRH" is known in the art as 
"thyrotropin-releasing hormone." See Summary of Bilek (Physiological Research 49 
(Suppl.l):S19-S26, 2000) (copy enclosed). "GRO," "NAP," "ENA," "PF4," "IP10," "GCP," 
"MCP," "MIP," "RANTES," and "IL" are known in the art as "growth-related oncogene," 
"neutrophil-activating protein," "epithelial cell-derived neutrophil-activating factor," "platelet 
factor 4," "interferon-inducible protein 10," "granulocyte chemoattractant protein," "monocyte 
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chemoattractant protein," "macrophage inflammatory protein," "regulated on activation normal 
T cell expressed and secreted," and "interleukin." See Table 1 at p. 148 in Murphy et al. 
(Pharmacological Reviews 52 (1): 145-176, 2000) (copy enclosed). 

In the case of "HC14," Applicants note that "[differential library screening with cDNA 
probes derived from stimulated versus resting PBL led to identification of two novel sequences, 
termed HC1 1 and HC14." Abstract of Chang et al. (International Immunology l(4):388-397, 
1989) (copy enclosed). Thus, according to Chang et al., the factor "HC14" is a term known in 
the art. Therefore, Applicants have not spelled out HC14 in the specification. 

Because one skilled in the art would understand what these acronyms represent, no new 
matter has been added. Withdrawal of the objection is respectfully requested. 

Amendment to Correct a Typographical Error in the Specification 

Applicants have amended the paragraph bridging page 16 and page 17 to correct a 
typographical error in the word "Edman." Thus, no new matter has been added. 

Rejections Under 35 U.S.C. § 112, Second Paragraph 

Claims 1, 45, 49, and 55 are rejected under 35 U.S.C. § 112, second paragraph, as 
allegedly "being indefinite for failing to particularly point out and distinctly claim the subject 
matter which applicant regards as the invention." Office Action at p. 3. 

First, the Office states that "[cjlaim 1 recites 'substantially the same as the amino acid 
sequence...'; the recitation is unclear in the metes and bounds of the relationship between the 
claimed protein and SEQ ID NO:2 in terms of the claim language 'substantially the same as. . . "' 
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Office Action at p. 4. Second, the Office states that claim 1 is "indefinite in the recitation of 
'represented by. '...Claims 45, 49, and 55, which depend from claim 1, are also rejected." Id. 

Applicants respectfully traverse. Amended claim 1 recites "an isolated protein 
comprising the amino acid sequence of SEQ ID NO:2 or a salt thereof." Therefore, withdrawal 
of the rejection is respectfully requested. 



Rejections Under 35 U.S.C. $ 112, First Paragraph 

Written description 

The Office has rejected claims 1, 45, 49, and 55 under 35 U.S.C. § 1 12, first paragraph, 
as allegedly failing to comply with the written description requirement. According to the Office, 
"the specification contains subject matter which was not described in the specification in such a 
way as to reasonably convey to one skilled in the relevant art that the inventor(s), at the time the 
application was filed, had possession of the claimed invention." Office Action at p. 5. 

The Office states that "[cjlaim 1 is directed to a genus of isolated proteins comprising a 
polypeptide which is substantially the same as the amino acid sequence or a partial sequence 
thereof." Office Action at p. 5. 

The Office further states that: 

[t]he specification neither provides representative species to 
describe the 'partial sequence' or the sequence which is 
substantially homologue to the instant SEQ ID NO:2. . .nor 
provides animal model for the disclosed pharmaceutical 
composition (claims 45 and 49) comprising (i) the polypeptide of 
SEQ ID No: 2 or (ii) the 'partial peptide' or (iii) the variant 
sequence substantially homologue to SEQ ID NO:2 wherein (i), 
(ii), or (iii) is used as a prophylactic (preventative) and/or 
therapeutic agent for disease involving abnormal adipocytes 
differentiation (claim 49). Both the art in related field and the 
instant disclosure fail to teach this regard. Therefore, applicants 
are not in possession of the claimed amino acid sequence 
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comprising variant polypeptide or oligopeptide which is the partial 
sequence of the full-length SEQ ID NO:2 (claim 1), the 
pharmaceutical composition (claims 45 and 49) and the kit (claim 
55) comprising the sequence thereof. 

Office Action at p. 7. 

Applicants respectfully traverse. Amended claim 1 recites "an isolated protein 
comprising the amino acid sequence of SEQ ID NO:2 or a salt thereof." 

Amended claim 45 recites "a composition for modulating the differentiation of and/or 
metabolism in an adipocyte comprising the protein or a salt thereof according to claim 1, and a 
pharmaceutically acceptable carrier, recipient or diluent." 

Claim 49 has been cancelled without prejudice or disclaimer. Moreover, amended claim 
55 no longer recites "its partial peptide." 

Therefore, withdrawal of the rejection is respectfully requested. 
Enablement 

Claims 1, 45, 49, and 55 are also rejected under 35 U.S.C. 112, first paragraph allegedly 
because "the specification while being enabling for the isolated protein of SEQ ID NO:2 and a 
composition thereof, does not reasonably provide enablement for all proteins encompassed by 
the claims and the pharmaceutical composition and kits thereof. The specification does not 
enable any person skilled in the art to which it pertains, or with which it is most clearly 
connected, to make and use the invention commensurate in scope with these claims." Office 
Action at p. 8. 

The Office states that: 

claims are directed to the isolated variant polypeptide/peptide 
which is substantially the same as the amino acid sequence or 
partial sequence (claim l)....The specification does not teach that 
the variant polypeptide or oligopeptide has the activity of the full- 
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length sequence of SEQ ID NO: 2, nor provides working examples 
thereof, and the claims do not require that the claimed protein have 
any particular function or activity. Also the specification fails to 
provide the working example or animal model for the claimed 
pharmaceutical composition comprising the polypeptide of SEQ 
ID NO:2 or the variant molecule thereof, and for the kit comprising 
the variant molecule thereof; where the scope of pharmaceutical 
composition is so broad as to prevent (prophylatic) or treat any 
disease involving abnormality of adipocytes differentiation and/or 
metabolism function. 

Office Action at p. 9. 

Applicants respectfully traverse. 

Amended claim 1 recites "an isolated protein comprising the amino acid sequence of 
SEQ ID NO:2 or a salt thereof." 

Amended claim 45 recites "a composition for modulating the differentiation of and/or 
metabolism in an adipocyte comprising the protein or a salt thereof according to claim 1 , and a 
pharmaceutical^ acceptable carrier, recipient or diluent." 

Claim 49 has been cancelled without prejudice or disclaimer. Moreover, amended claim 
55 no longer recites "its partial peptide." 

Therefore, withdrawal of the rejection is respectfully requested. 



Rejections Under 35 U.S.C. § 102 

102(b) 

Claim 1 has been rejected under 35 U.S.C. 102(b) as allegedly being anticipated by 
Wood et al. (WO 9914328). The Office states that "[i]n patent claim 12, Wood et al. teach an 
isolated polypeptide comprising SEQ ID NO:3 10... which has 90.4% sequence identity to instant 
SEQ ID NO:2. . ., which anticipate instant claim 1 ." Office Action p. 12. 
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Applicants respectfully traverse. 

Amended claim 1 recites "an isolated protein comprising the amino acid sequence of 
SEQ ID NO:2 or a salt thereof." Wood et al. does not teach an isolated protein comprising the 
amino acid of SEQ ID NO:2 or a salt thereof. Thus, withdrawal of the rejection is respectfully 
requested. 



102(e) 

Claims 1, 45, 48, and 55 are rejected under 35 U.S.C. 102(e) as allegedly being 

anticipated by Goddard et al. (U.S. Patent No. 7,189,807). The Office states that 

Goddard et al. teach the isolated polypeptide comprising SEQ 
ID NO:370. . .which has 90.4% sequence identity to instant SEQ ID 
NO:2. The polypeptide is isolated because SEQ ID NO: 3 70 is 
encoded by isolated cDNA clone... because Goddard et al. teach 
the polypeptide is recombinantly isolated. Thus, Goddard et al. 
teach the instant claim 1 . 

Goddard et al. teach a pharmaceutical composition comprising 
PRO polypeptide (col. 6, lines 14-19) which is encoded by PR0332 
cDNA, i.e., PR0332 polypeptide, which consists of SEQ ID 
NO:332 (Figure 370, col. 24, lines 1-3). The Goddard et al. 
teachings anticipate instant claim 45. 

Since the recitation of claim 49 "which is a prophylactic and/or 
therapeutic agent for disease ..." is considered to be an intended 
use which has little patentable weight, the above Goddard et al. 
teachings anticipate instant claim 49. 

Claim 55, reciting a kit, as written, contains no positive 
recitation of the ingredients which distinguishes it over the 
references; therefore, the kit (a composition) is encompassed by 
the Goddard reference. 



Office Action at p. 12-13. 

Applicants respectfully traverse. 
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Amended claim 1 recites "an isolated protein comprising the amino acid sequence of 
SEQ ID NO:2 or a salt thereof," and claims 45 and 55 depend on claim 1. Moreover, claim 55 
no longer recites "its partial peptide," and claim 49 has been cancelled. Goddard et al. does not 
teach an isolated protein comprising the amino acid of SEQ ID NO:2 or a salt thereof. 
Therefore, Goddard et al. does not anticipate the claims. Withdrawal of the request is 
respectfully requested. 



Rejections Under 35 U.S.C. § 103(a) 

The Office states that "[c]laims 1 and 55 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Wood et al. (WO 9914328) in view of Ali et al. (US 20020037540 Al). The 
rejection of claim 1 by Wood et al. has been discussed above. Yet, Wood et al. do not expressly 
teach a kit comprising the polypeptide. At [0150], Ali et al. teach a kit comprising a polypeptide 
useful for identifying ligand, agonist or antagonist for said polypeptide, as applied to claim 55." 
Office Action at p. 14. 

The Office continues by stating: 

It would have been obvious to one of ordinary skill in the art at 
the time the invention was made to formulate the kit comprising 
the full-length polypeptide of instant SEQ ID NO:370. This is 
because the polypeptide of SEQ ID NO:370 (also called: PR0332 
polypeptide, see page 107, line 27) taught by Wood et al. is useful 
for identifying its ligand, e.g., receptor-ligand interaction (see page 
98, lines 28-36), and because Ali et al. have taught that the kit 
developed is useful for the same purpose, i.e., identifying the 
ligand for said polypeptide. Moreover, it is a well known 
convention in the art to place these components in a pack or kit for 
convenience and economy. Therefore, in order to conveniently 
screening for the ligand including agonist, antagonist or/and 
receptor of said polypeptide, one skilled in the art would have 
constructed the kit comprising the polypeptide according to the 
above references' teachings with reasonable expectation of success. 
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Therefore, the claimed invention was prima facie obvious to make 
and use the invention at the time it was made. 



Office Action at p. 14-15. 

Applicants respectfully disagree. Amended claim 1 recites "an isolated protein 
comprising the amino acid sequence of SEQ ID NO:2 or a salt thereof," and claim 55, which 
depends on claim 1, no longer recites "its partial peptide." As discussed above, Wood et al. does 
not teach or suggest an isolated protein comprising the amino acid sequence of SEQ ID NO:2 or 
a salt thereof. Ali et al. does not make up for the deficiences of Wood et al. Therefore, Wood et 
al. in view of Ali et al. do not render obvious claims 1 and 55. Withdrawal of the rejection is 
respectfully requested. 

In view of the foregoing amendments and remarks, Applicants respectfully request 
reconsideration and reexamination of this application and the timely allowance of the pending 
claims. 

Please grant any extensions of time required to enter this response and charge any 
additional required fees to our deposit account 06-0916. 



Respectfully submitted, 



FINNEGAN, HENDERSON, FARABOW, 
GARRETT & DUNNER, L.L.P. 



Dated: January 16, 2008 




Yukbponeoka 
Reg/No. 60,018 
(650) 849-6679 
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Rapid production of full-length cDNAs from rare transcripts: 
Amplification using a single gene-specific oligonucleotide primer 

(polymerase chain reaction/5' and 3' cDNA ends/cDNA cloning/low-abundance mRNAs/ int-2 gene) 

Michael A. Frohman, Michael K. Dush, and Gail R. Martin 



Department of Anatomy, University of California, San Francisco, CA 94143 
Communicated by Peter H. Duesberg, September 9, 1988 

ABSTRACT We have devised a simple and efficient cDNA 
doning strategy that overcomes many of the difficulties en- 
countered in obtaining full-length cDNA clones of low- 
abundance mRNAs. In essence, cDNAs are generated by using 
the DN A polymerase chain reaction technique to amplify copies 
of the region between a single point in the transcript and the 3' 
or 5' end. The minimum information required for this ampli- 
fication is a single short stretch of sequence within the mRNA 
to be cloned. Since the cDNAs can be produced in one day, 
examined by Southern blotting the next, and readily cloned, 
large numbers of full-length cDNA clones of rare transcripts 
can be rapidly produced. Moreover, separation of amplified 
cDNAs by gel electrophoresis allows precise selection by size 
prior to cloning and thus facilitates the isolation of cDNAs 
representing variant mRNAs, such as those produced by 
alternative splicing or by the use of alternative promoters. The 
efficacy of this method was demonstrated by isolating cDNA 
clones of mRNA from int-2, a mouse gene that expresses four 
different transcripts at low abundance, the longest of which is 
-2.9 kilobases. After <0.05% of the cDNAs produced had 
been screened, 29 independent int-2 clones were isolated. 
Sequence analysis demonstrated that the 3' and 5' ends of all 
four int-2 mRNAs were accurately represented by these clones. 



Despite the development of numerous cDNA cloning strat- 
egies (1-5), obtaining full-length cDNA copies of low- 
abundance mRNAs remains a formidable task. We describe 
here a simple, rapid, and efficient cDNA cloning strategy that 
is based on the DNA polymerase chain reaction (PCR) 
technique developed by Saiki et at. (6). PCR employs two 
oligonucleotide primers, one complementary to a sequence 
on the (+) strand and the other to a downstream sequence on 
the (-) strand. Reiterative cycles of denaturation, annealing, 
and extension are used to generate multiple copies of the 
DNA that lies between the two primers. PCR has been used 
for a variety of purposes (7-13), including the detection of 
allelic polymorphisms and of DNA sequences unique to rare 
cells in a population, as well as the cross-species isolation of 
homologous genes. The feature common to all applications of 
PCR to date has been the use of primers designed to match 
two known or presumed genomic or cDNA sequences. In 
contrast, we have devised an application of this technique 
that achieves amplification and cloning of the region between 
a single short sequence in a cDNA molecule and its unknown 
3' or 5' end. We demonstrate here the utility of this strategy, 
termed "rapid amplification of cDNA ends" (RACE), by 
using it to obtain clones representing transcripts of a gene, 
int-2, expressed at low abundance in the early mouse em- 
bryo. 



The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked "advertisement" 
in accordance with 18 U.S.C. §1734 solely to indicate this fact. 



METHODS 

3' -End Amplification of cDNAs. See Fig. 1 for the scheme. 
Reverse transcription. One microgram of poly(A) + RNA (ref. 
14; ref. 15, pp. 91-98) in 16.5 pi of water was heated at 65°C 
for 3 min, quenched on ice, added to 2 pi of 10 x RTC buffer 
(lx RTC buffer is 50 mM Tris-HCl, pH 8.15 at 41°C/6 mM 
MgCl 2 /40 mM KC1/1 mM dithiothreitol/each dNTP at 1.5 
mM), 0.25 pi (10 units) of RNasin (Promega Biotec, Madison, 
WI), 0.5 pi of (dT)i r adaptor (1 pg/pl), and 10 units of avian 
myeloblastosis virus reverse transcriptase (Life Sciences, 
Saint Petersburg, FL), and incubated for 2 hr at 41°C. The 
reaction mixture was diluted to 1 ml with TE (10 mM 
Tris-HCl, pH 7.5/1 mM EDTA) and stored at 4°C. 

Amplification. The cDNA pool (1 pi) and amplification 
(3 'amp) and adaptor primers (25 pmol each) in 50 pi of PCR 
cocktail [10% (vol/vol) dimethyl sulfoxide/1 x Taq polymer- 
ase buffer (New England Biolabs)/each dNTP at 1.5 mM] 
were denatured (5 min, 95°C) and cooled to 72°C. Then 2.5 
units of Thermits aquaticus (Taq) DNA polymerase (Perkin- 
Elmer-Cetus) was added and the mixture was overlaid with 
30 pA of mineral oil (Sigma) at 72°C and annealed at 50-58°C 
for 2 min. The cDNA was extended at 72°C for 40 min. Using 
a DNA Thermal Cycler (Perkin-Elmer-Cetus), we carried 
out 40 cycles of amplification by using a step program (94°C, 
40 sec; 50-58°C, 2 min; 72°C, 3 min), followed by a 15-min 
final extension at 72°C. 

5' -End Amplification of cDNAs. See Fig. 1. One microgram 
of poly(A) + RNA was reverse transcribed as described above 
exceptforthe addition of 20 /tCi(lCi = 37GBq)of [ 32 P]dCTP 
and the substitution of 20 pmol of 5RT primer for (dT) n - 
adaptor. Excess 5RT was removed as follows: the 20-pl 
cDNA pool was applied to a Bio-Gel A-5m (Bio-Rad) column 
(in a 2-ml serological pipette plugged with silane-treated glass 
wool) equilibrated with 0.05 x TE. Void volume (0.8 ml) and 
30 one-drop fractions were collected. Fractions -4 to +3 
relative to the first peak of radioactivity were pooled, 
concentrated by centrifugation under reduced pressure 
(Speedvac), and adjusted to 23 pi. For tailing, 1 pi of 6 mM 
dATP, 6 pi of 5x tailing buffer (Bethesda Research Labora- 
tories), and 15 units of terminal deoxynucleotidyl-transferase 
(Bethesda Research Laboratories) were added, and the 
mixture was incubated for 10 min at 37°C and heated for 15 
min at 65°C. The reaction mixture was diluted to 500 pi in TE 
and 1- to lO-p\ aliquots were used for amplification as 
described for the 3 '-end procedure, except for the substitu- 
tion of (dT)i7-adaptor (10 pmol), adaptor (25 pmol), and 
amplification (5 'amp, 25 pmol) primers. 

Southern and RNA Blot Analysis. Ten-microliter aliquots of 
RACE reaction products were separated by electrophoresis 
[1% agarose gel containing ethidium bromide (EtdBr) at 0.5 



Abbreviations: RACE, rapid amplification of cDNA ends; PCR, 
(DNA) polymerase chain reaction; 3'amp and 5 'amp, gene-specific 
primers for 3'- and 5'-end amplification; 5RT, gene-specific primer 
for reverse transcription; EtdBr, ethidium bromide. 
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Fig. 1. Schematic representation of the RACE protocol. Primers: ****TTTT (dT) 17 -adaptor, 5 '-G ACTCG AGTCG ACATC- 
G ATTTTTTTTTTTTTTTTT- 3 ' . This sequence contains the Xho I, Sal I, and Cla I recognition sites. ****, Adaptor, 5'- 
G ACTCG AGTCG ACATCG-3 ' . 3'amp (amp refers to amplification), specific to gene of interest, complementary to (-) strand. 5RT (RT refers to 
reverse transcription) and 5'amp, specific to gene of interest, complementary to (+) strand. Open rectangles represent DNA strands actively being 
synthesized; shaded rectangles represent DNA previously synthesized. At each step the diagram is simplified to illustrate only how the new product 
formed during the previous step is utilized. A (-) or (+) strand is designated as "truncated" (TR) when it is shorter than the original (-) or (+) 
strand, respectively. 



fig/ 'ml], transferred to GeneScreen (New England Nuclear; 
ref. 16), and hybridized at high stringency (17) with a 
32 P-labeled probe (Bethesda Research Laboratories nick- 
translation kit). RNA blot analysis was carried out as previ- 
ously described (17). 

Cloning and Sequencing of cDNAs. RACE products were 
transferred into TE by using spun column chromatography (ref. 
15, pp. 466-467), digested with restriction enzymes that rec- 
ognize sites in the adaptor (Cla I or Sal I) or int-2 sequences, and 
separated by electrophoresis. Regions of the gel containing 
specific products were isolated, and the DNA was extracted 
with Glassmilk (Bio 101, San Diego, CA) and cloned in a 
Bluescript vector (Stratagene, San Diego, CA). Plasmids with 
int-2 cDNA inserts were identified by colony lift hybridization 
(ref. 15, p. 324). Restriction analyses (ref. 15, p. 104) were 
carried out on plasmid DNA prepared by the alkaline lysis 
method (18). Mini-prep plasmid DNA was sequenced with 



Sequenase (United States Biochemicals, Cleveland), using the 
supplier's recommendations. 

It should be noted that whereas all 23 clones generated 
after Sal I cleavage of the adaptor contained an intact Sal I 
site, many of the clones generated after cleavage with Cla I 
were altered at the Cla I site and could no longer be cleaved 
by that enzyme (M.A.F., unpublished data). It is unknown 
whether such alteration was due to the positioning of the Cla 
I site immediately adjacent to the oligo(dT) sequence. 

RESULTS 

The RACE Protocol. To obtain cDNA 3' ends (Fig. 1), a 
RNA population is reverse transcribed to create a cDNA (-) 
strand. The primer used is a 35-base oligonucleotide with 17 
dT residues and an adaptor sequence containing three endo- 
nuclease recognition sites that are found infrequently in 
genomic DNA [(dT) 17 -adaptor; see legend to Fig. 1]. The 
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presence of the adaptor places a unique sequence at the 
unknown end of the cDNA. Next, a gene-specific amplifica- 
tion primer (3'amp) is annealed with a small portion of the 
first (-) strand product (generally <1 ng) and extended to 
generate a complementary second (+) strand. Reiterative 
PCR cycles using 3 'amp and adaptor primers then amplify the 
double-stranded cDNAs. Specificity of amplification is de- 
pendent on base-pairing of 3'amp only to molecules repre- 
senting the mRNA of interest. The adaptor primer is used 
instead of the (dT)i 7 -adaptor primer because initial experi- 
ments suggested that long stretches of dT residues do not 
base pair well at the temperatures used to prevent mismatch- 
ing of the specific primers. 

A similar strategy is used to obtain cDNA 5' ends (Fig. 1). 
The initial specificity is achieved by using a gene-specific 
primer for reverse transcription (5RT). The primer-extended 
products are then separated from excess primer and a poly(A) 
tail is added by using dATP and terminal deoxynucleotidyl- 
transferase. Second (+) strand synthesis is carried out with the 
(dTJn-adaptor primer. Finally, the products are amplified by 
using the adaptor primer and a gene-specific amplification 
primer (5'amp) consisting of a sequence located upstream of 
the extension primer sequence. This "internal" primer in- 
creases the specificity and efficiency of the amplification 
reaction; it should bind only to the cDNAs of interest, whereas 
5RT would bind to all (+) strand cDNAs present, including 
any generated through mismatched hybridization of 5RT 
during reverse transcription. Moreover, 5'amp would not bind 
to any residual 5RT that had been tailed. Such tailed 5RT, 
because of its small size and relatively large initial concentra- 
tion, would serve as a potent substrate for a PCR reaction that 
included the SRT and (dT) 17 -adaptor primers, and this would 
greatly decrease the efficiency of the cDNA amplification. 

After amplification, the products are analyzed by restric- 
tion and Southern blot analyses and cloned. The optimal 
cloning strategy utilizes one restriction endonuclease that 
cleaves within the adaptor sequence and a second that 
cleaves within the amplified region. This adds selectivity to 
the cloning process, since most of the nonspecific cDNA 
products will not be cleaved by the second enzyme and thus 
will not be cloned. Alternatively, the RACE products can be 
cloned intact by also incorporating a restriction site into the 
5' end of the gene-specific primer. Finally, full-length cDNA 
clones can be reconstructed from separate but overlapping 3' 
and 5' RACE products, or they can be synthesized by using 
primers whose sequences were obtained by analysis of the 
extreme 5' and 3' ends of the RACE products for PCR 
amplification of reverse-transcribed mRNA. 

Isolation of int-2 cDN A Clones by Using the RACE Protocol. 
We tested this scheme by generating cDNA copies of mRNA 
from int-2, a gene characterized by multiple transcripts 
expressed at very low abundance (=2 copies per cell, ref. 19), 
and whose complete sequence is known (20). Because we had 
recently isolated int-2 cDNA clones by conventional methods 
(21), we had a standard against which we could judge the 
results of the present study. On the basis of our previous 
results, we expected to obtain int-2 cDNA clones with 
alternative start and polyadenylylation sites (Fig. 2A). 

The first 3'amp primer used, 3'amp-D, was specific to the 
long int-2 mRNAs (Fig. 2A). The 3 '-end cDNA product of the 
amplification reaction was expected to be =1050 bp in length. 
DNA of the expected size was visible after EtdBr staining 
(Fig. 2B, lane 3'D), and its identity was confirmed by 
Southern blot analysis using an int-2 cDNA probe (Fig. 2C, 
lane 3'D). In addition, most samples contained int-2 cDNAs 
of unexpected size, which were subsequently determined to 
be single-stranded molecules (see below). In some circum- 
stances other reaction products were visible after EtdBr 
staining, but they did not hybridize to the int-2 probe; we 
found that such nonspecific PCR products could be mini- 



mized by optimizing the annealing temperature (M.A.F., 
unpublished data). EtdBr-staining bands were also visible in 
the control sample, but they did not hybridize with the int-2 
probe (Fig. 2 B and C, lane 3'C). 

The second 3'amp primer used, 3'amp-U, was common to 
both short and long classes (Fig. 2A) and thus was expected 
to generate cDNA products 480 and 1580 bp in length, 
respectively. Of these, only the 480-bp product was detect- 
able by staining with EtdBr (Fig. IB, lane 3'U), but the 
presence of both cDNAs was demonstrated by Southern blot 
analysis (Fig. 2C, lane 3'U). The observation that the 480-bp 
cDNA is = 10-fold more abundant than the 1580-bp cDNA is 
inconsistent with the fact that the short classes of mRNA are 
only =2-fold more abundant than the long ones (Fig. 2A). 
This finding, that the relative abundance of different cDNA 
products generated by multiple cycles of PCR may not 
accurately reflect the composition of the mRNA population, 
is presumably due to a decrease in PCR efficiency as 
extension length increases. 

When int-2 cDNA 5' ends were produced by using the RACE 
strategy, we expected =690- and =930-bp products represent- 
ing the 5' ends of the class B and the less abundant class A 
mRNAs, respectively (Fig. 2A). Although both fragments could 
be detected by Southern blot analysis, they were not always 
visible after EtdBr staining (Fig. 2 B and C, lane 5'P). 

The specific products of the 3'- and 5 '-end RACE reactions 
were isolated from agarose gels and cloned. The possibility 
afforded by the RACE protocol of separating different classes 
of cDNAs by gel electrophoresis prior to cloning was partic- 
ularly useful in obtaining cDNAs representing class A 
mRNAs, which would otherwise have been difficult to detect 
in the presence of the much more numerous class B clones. 
However, we did not further size-select the RACE products in 
either class, and therefore we were able to examine the range 
of cDNAs produced by this protocol. Together the clones 
obtained represented all four transcripts expressed from the 
int-2 gene, the longest of which is =2.9 kilobases (kb). 

The cDNA clones representing the 3' ends of the long and 
short classes of int-2 mRNA were expected to terminate at 
int-2 genomic positions 7508 and 6404, respectively. Se- 
quence analysis of four independent cDNA clones (three of 
the long and one of the short class) confirmed that the 
products obtained by the RACE method were identical to 
those obtained by conventional cDNA cloning methods (21). 
Previous studies suggested that the 5' ends of class A and B 
int-2 mRNAs begin around positions 938 and 1677/78 of the 
int-2 genomic sequence, respectively (21); however, single 
start sites are unlikely since the int-2 promoters lack TATA 
boxes, and promoters of this type usually initiate transcrip- 
tion at multiple sites (23-25). Sequence analysis suggested 
that the majority of the clones studied were full length: 11/16 
class B clones and 5/9 class A clones began at or within 21 
bp of the presumed start sites (Fig. 2D). A higher frequency 
of full-length cDNA clones presumably could have been 
obtained by precise size selection of the RACE products 
prior to cloning. 

In the course of developing and testing the RACE protocol, 
we discovered the following: 

(i) For some mRNAs (not int-2) not all of the 3' end 
cDNAs were full length. We found this to be the case when 
we characterized cDN A clones of transcripts from the mouse 
En-1 gene (17); sequence analysis of 3'-end En-1 cDNA 
clones suggested that in some cases the (dT) 17 -adaptor primer 
had bound to an A-rich region in the coding sequence 
(AAAGAAGAAAAAGAA) upstream of the poly(A) tail. 
This could be minimized by decreasing the concentration of 
the (dT) ir adaptor primer to 15 ng/ 'jig of mRNA (M.A.F., 
unpublished data). 

(if) Some of the specific reaction products appeared to be 
single-stranded cDNAs. These reaction products hybridized 
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transcripts found in these cells as previously defined (21). Primers used are depicted as hatched boxes with arrows. Primer sequences: 3'amp 
primers 3'amp-U (5'-GGCAGAAGAAGCAGAGC-3') and 3'amp-D (5'AGAAGGCAGTGATCTGC-3')> complementary to (-) strand; 5RT 
(5'-GGCTCTGCTTCTTCTGC-3') and 5'amp primer (5'-CTCATGGTCCTTGTGGC-3'), complementary to (+) strand, bp, Base pairs. (B and 
C) Analysis of int-2 RACE reaction products. Poly(A) + RNA isolated from endoderm-like cells obtained by treating F9 embryonal carcinoma 
cells with retinoic acid and dibutyryl-cAMP (22) was subjected to the RACE protocol. The reaction products were analyzed by electrophoresis 
and EtdBr staining (B) and Southern blot hybridization using an int-2 cDNA probe (C). Marker lane (M) contains a 123-bp ladder (Bethesda 
Research Laboratories); sample lanes are labeled with the name of the gene-specific primer used in the amplification reaction (3'D, 3'amp-D, 
3'U, 3'amp-U; 5'P, 5'amp) or as controls in which amplification was carried out in the absence of an int-2 primer (3'C, 5'C). An * indicates 
single-stranded int-2 cDNA. (D) Analysis of 5'-end int-2 cDNA clones. The thin lines represent int-2 genomic DNA and the open boxes the exons 
of the class A and B int-2 mRNAs. The 5' ends of the mRNAs are symbolized by dotted vertical lines to indicate that multiple start sites are 
used. The jagged lines indicate that only the 5' portion of exon 3 is shown. The filled dots indicate the positions of the 5' ends of RACE clones 
as determined by sequence analysis, and the open dots the 5' ends as determined by n 

to int-2 probes, but they did not cleave with appropriate 
restriction endonucleases and were eliminated by digestion 
with mung bean nuclease (Fig. 3A). The quantity of these 
single-stranded products was also decreased by altering the 
buffer conditions (Fig. 3A, lane B), suggesting that comple- 
mentary DNA strands are present and that appropriate 
binding can occur under suitable conditions. Single-stranded 
products are not usually observed with PCR, but use of a 
gene-specific primer in conjunction with a primer that binds 
to every cDNA present and that contains a long homopoly- 
meric stretch may lead to greater production of one of the 
strands, or to mismatched pairing of some strands after the 
final round of PCR. Because these single-stranded products 
cannot easily be cloned, it is important to determine which of 
the specific products are double stranded. 

(Hi) The relative amount of longer cDNA products could 
be dramatically increased in several cases by substantially 
increasing the extension time of the initial (+) strand syn- 
thesis reaction (Fig. 32?). We also noted that small changes in 
the annealing temperatures (2°C) could cause significant 
changes in the ratio of specific to nonspecific cDNA ampli- 
fication (M.A.F., unpublished data). In practice, we suggest 
using a conservative annealing temperature (SOT) for initial 



studies and raising it until the optimal temperature is reached 
for obtaining the cDNA product of interest, as judged by 
Southern blot analysis. 

DISCUSSION 

The RACE protocol described here provides an alternative to 
other cDNA cloning methods that is advantageous in many 
respects. A single gene-specific primer is used to generate 
3'-end clones, whose sequences can be used, if necessary, to 
design the primers required for 5'-end cloning. The protocol 
involves few steps and the only reagents that must be 
specially prepared are the oligonucleotide primers. More- 
over, steps common to other methods can be much simpler 
in the RACE protocol. For example, terminal deoxynucleo- 
tidyltransferase tailing conditions are generally optimized 
and monitored, since it is considered important to limit the 
length of the tail (4, 5). Using the RACE protocol, however, 
we found that the length of the poly(A) tail on the cDNA 
products did not exceed 34 nucleotides, even though no 
precautions were taken to limit tail length. 

The RACE protocol is rapid and can be executed in 1 day. 
Moreover, it is unique in that substantial information about 
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Fig. 3. (A) Sensitivity of specific RACE products to mung bean 
nuclease, int-2 cDNAs were generated by using the RACE 5'-end 
protocol and transferred into TE by spun column chromatography. 
The samples were then incubated at 30°C for 30 min with no additions 
(None), with buffer only [B: 30 mM NaOAc, pH 5/50 mM NaCl/1 
mM ZnCl 2 /5% (vol/vol) glycerol], with buffer + 0.2 unit of mung 
bean nuclease (0.2N), or with buffer + 1 unit of mung bean nuclease 
(IN), and analyzed by Southern blot hybridization using an int-2 
probe. The 690-bp fragment is the amplified class B 5'-end cDNA; 
* indicates single-stranded int-2 cDNA. (B) Effect of extension time 
on production of specific RACE products, int-2 cDNA was prepared 
by using the RACE 3' end protocol with 3'amp-U as described in 
Materials and Methods, except that the time allowed for the 
synthesis of the (+) strand during the first cycle of PCR was varied; 
in subsequent steps the extension time was 3 min. Samples were 
analyzed by Southern blotting. Lanes are designated according to the 
length of the extension time (3, 10, 20, and 40 min). The 480- and 
1580-bp fragments are j'-end cDNAs of the short and long int-2 
mRNAs, respectively; * indicates single-stranded int-2 cDNA. 

the cDNAs can be obtained prior to cloning and screening 
steps. Specifically, the RACE products can be examined by 
Southern blotting to determine whether the desired cDNAs 
are present and to evaluate the complexity of the cDNA 
population. This allows cDNAs representing unusual tran- 
scripts, such as those created by alternative splicing or by the 
use of alternative promoters, to be selected for cloning. 

The RACE protocol is more efficient than other methods. 
This is evident from a comparison of the results of cloning 
int-2 cDNAs by using conventional library construction vs. 
the RACE method. Starting with a total of 15 ptg of poly(A) + 
RNA, we constructed two cDNA libraries, screened 2.4 x 
10 6 recombinant phage, and isolated a total of 9 int-2 cDNAs 
(21). In the present study we started with 0.5 fig of poly(A) + 
RNA, generated and screened several hundred recombinant 
plasmids with <0.05% of the RACE products, and isolated 29 
independent int-2 cDNA clones. Sequence analysis demon- 
strated that the 3' and 5' ends of all of four int-2 mRNAs were 
accurately represented by the RACE products. Moreover, all 
of the 3 '-end clones terminated at known polyadenylylation 
sites and the majority of the 5 '-end RACE clones appeared to 
be full length. Thus, we have used this technique to readily 
obtain full-length cDNA clones representing transcripts 1.4 
to 2.9 kb long. As our data suggest, however, PCR amplifi- 
cation becomes less efficient with increasing fragment length. 
Therefore, in cases in which the generation of full-length 
cDNAs would require amplification over a region more than 
several kilobases in length, production of the desired cDNAs 
may be difficult. Nevertheless, in most cases it should be 
possible to devise strategies to overcome this limitation. 
Another potential problem is that a limited number of errors 
are introduced by the use of multiple rounds of amplification 
(ref. 7; =0.1%, unpublished data of M.A.F.). Therefore, the 
sequence of a cDNA clone obtained by the RACE method 
should be verified by comparison with sequences of several 
independent cDNA clones or genomic DNA. 

Other potential applications of this protocol are evident. 
For example, primers based on amino acid sequence might be 
adequate for RACE cDNA cloning, although our preliminary 
data suggest that the feasibility of this approach may be 
limited by nonspecific amplification due to primer mismatch. 



It should also be possible to use a modified RACE protocol 
to construct general cDNA libraries: one could reverse 
transcribe using (dT)rradaptor primer, tail the (-) strand 
products with G or C residues, generate a (+) strand with a 
different adaptor on its 5' end, and amplify the pooled 
cDNAs. Since the RACE protocol as presently described 
requires less than 1 ng of mRNA (=5000 copies of a 
low-abundance message) and this amount could be reduced 
considerably, it is conceivable that the construction of such 
general cDNA libraries ultimately could be carried out with 
the RNA from a single cell. 
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